Overview

Dataset info

Number of variables21
Number of observations21613
Missing cells5 (< 0.1%)
Duplicate rows0 (0.0%)
Total size in memory4.8 MiB
Average record size in memory232.0 B

Variables types

NUM19
BOOL1
CAT1

Reproduction info

Date of analysis2020-05-28 01:24:18.314874
Versionpandas-profiling v2.4.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download Configurationconfig.yaml

Warnings

date has a high cardinality: 372 distinct values Warning
sqft_basement has 13126 (60.7%) zeros Zeros
view has 19489 (90.2%) zeros Zeros
yr_renovated has 20699 (95.8%) zeros Zeros

Variables

bathrooms
Real number (ℝ≥0)

Distinct count30
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.114757322
Minimum0
Maximum8
Zeros10
Zeros (%)< 0.1%
Memory size169.0 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile1
Q11.75
median2.25
Q32.5
95-th percentile3.5
Maximum8
Range8
Interquartile range (IQR)0.75

Descriptive statistics

Standard deviation0.7701631572
Coefficient of variation (CV)0.3641851238
Kurtosis1.279902444
Mean2.114757322
Median Absolute Deviation (MAD)0.6153609574
Skewness0.5111075733
Sum45706.25
Variance0.5931512887
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0. 0.625 0.875 1.125 1.375 ... 4.125 4.625 5.375 6.125 8. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2.5 5380 24.9%
 
1 3852 17.8%
 
1.75 3048 14.1%
 
2.25 2047 9.5%
 
2 1930 8.9%
 
1.5 1446 6.7%
 
2.75 1185 5.5%
 
3 753 3.5%
 
3.5 731 3.4%
 
3.25 589 2.7%
 
Other values (20) 652 3.0%
 
ValueCountFrequency (%) 
0 10 < 0.1%
 
0.5 4 < 0.1%
 
0.75 72 0.3%
 
1 3852 17.8%
 
1.25 9 < 0.1%
 
ValueCountFrequency (%) 
8 2 < 0.1%
 
7.75 1 < 0.1%
 
7.5 1 < 0.1%
 
6.75 2 < 0.1%
 
6.5 2 < 0.1%
 

bedrooms
Real number (ℝ≥0)

Distinct count14
Unique (%)0.1%
Missing4
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean3.370910269
Minimum0
Maximum33
Zeros13
Zeros (%)0.1%
Memory size169.0 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile2
Q13
median3
Q34
95-th percentile5
Maximum33
Range33
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.9300844679
Coefficient of variation (CV)0.2759149291
Kurtosis49.06741079
Mean3.370910269
Median Absolute Deviation (MAD)0.7349766366
Skewness1.974439161
Sum72842
Variance0.8650571175
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3 9822 45.4%
 
4 6881 31.8%
 
2 2759 12.8%
 
5 1601 7.4%
 
6 272 1.3%
 
1 199 0.9%
 
7 38 0.2%
 
8 13 0.1%
 
0 13 0.1%
 
9 6 < 0.1%
 
Other values (3) 5 < 0.1%
 
(Missing) 4 < 0.1%
 
ValueCountFrequency (%) 
0 13 0.1%
 
1 199 0.9%
 
2 2759 12.8%
 
3 9822 45.4%
 
4 6881 31.8%
 
ValueCountFrequency (%) 
33 1 < 0.1%
 
11 1 < 0.1%
 
10 3 < 0.1%
 
9 6 < 0.1%
 
8 13 0.1%
 

condition
Real number (ℝ≥0)

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.40942951
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Memory size169.0 KiB
Mini histogram

Quantile statistics

Minimum1
5-th percentile3
Q13
median3
Q34
95-th percentile5
Maximum5
Range4
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.6507430464
Coefficient of variation (CV)0.1908656696
Kurtosis0.5257635653
Mean3.40942951
Median Absolute Deviation (MAD)0.5607190317
Skewness1.032804637
Sum73688
Variance0.4234665124
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3 14031 64.9%
 
4 5679 26.3%
 
5 1701 7.9%
 
2 172 0.8%
 
1 30 0.1%
 
ValueCountFrequency (%) 
1 30 0.1%
 
2 172 0.8%
 
3 14031 64.9%
 
4 5679 26.3%
 
5 1701 7.9%
 
ValueCountFrequency (%) 
5 1701 7.9%
 
4 5679 26.3%
 
3 14031 64.9%
 
2 172 0.8%
 
1 30 0.1%
 

date
Categorical

HIGH CARDINALITY
Distinct count372
Unique (%)1.7%
Missing0
Missing (%)0.0%
Memory size169.0 KiB
20140623T000000
 
142
20140626T000000
 
131
20140625T000000
 
131
20140708T000000
 
127
20150427T000000
 
126
Other values (367)
20956
ValueCountFrequency (%) 
20140623T000000 142 0.7%
 
20140626T000000 131 0.6%
 
20140625T000000 131 0.6%
 
20140708T000000 127 0.6%
 
20150427T000000 126 0.6%
 
20150325T000000 123 0.6%
 
20150428T000000 121 0.6%
 
20150422T000000 121 0.6%
 
20150414T000000 121 0.6%
 
20140709T000000 121 0.6%
 
Other values (362) 20349 94.2%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length15
Mean length15
Min length15
Scatter

floors
Real number (ℝ≥0)

Distinct count7
Unique (%)< 0.1%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean1.494331853
Minimum1
Maximum3.5
Zeros0
Zeros (%)0.0%
Memory size169.0 KiB
Mini histogram

Quantile statistics

Minimum1
5-th percentile1
Q11
median1.5
Q32
95-th percentile2
Maximum3.5
Range2.5
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.539990919
Coefficient of variation (CV)0.3613594383
Kurtosis-0.4847811464
Mean1.494331853
Median Absolute Deviation (MAD)0.4885221039
Skewness0.616106725
Sum32295.5
Variance0.2915901926
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 10679 49.4%
 
2 8241 38.1%
 
1.5 1910 8.8%
 
3 613 2.8%
 
2.5 161 0.7%
 
3.5 8 < 0.1%
 
(Missing) 1 < 0.1%
 
ValueCountFrequency (%) 
1 10679 49.4%
 
1.5 1910 8.8%
 
2 8241 38.1%
 
2.5 161 0.7%
 
3 613 2.8%
 
ValueCountFrequency (%) 
3.5 8 < 0.1%
 
3 613 2.8%
 
2.5 161 0.7%
 
2 8241 38.1%
 
1.5 1910 8.8%
 

grade
Real number (ℝ≥0)

Distinct count12
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.656873178
Minimum1
Maximum13
Zeros0
Zeros (%)0.0%
Memory size169.0 KiB
Mini histogram

Quantile statistics

Minimum1
5-th percentile6
Q17
median7
Q38
95-th percentile10
Maximum13
Range12
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.175458757
Coefficient of variation (CV)0.1535168116
Kurtosis1.190932077
Mean7.656873178
Median Absolute Deviation (MAD)0.929600303
Skewness0.7711032008
Sum165488
Variance1.381703289
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 1. 3.5 4.5 5.5 6.5 ... 9.5 10.5 11.5 12.5 13. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
7 8981 41.6%
 
8 6068 28.1%
 
9 2615 12.1%
 
6 2038 9.4%
 
10 1134 5.2%
 
11 399 1.8%
 
5 242 1.1%
 
12 90 0.4%
 
4 29 0.1%
 
13 13 0.1%
 
Other values (2) 4 < 0.1%
 
ValueCountFrequency (%) 
1 1 < 0.1%
 
3 3 < 0.1%
 
4 29 0.1%
 
5 242 1.1%
 
6 2038 9.4%
 
ValueCountFrequency (%) 
13 13 0.1%
 
12 90 0.4%
 
11 399 1.8%
 
10 1134 5.2%
 
9 2615 12.1%
 

id
Real number (ℝ≥0)

Distinct count21436
Unique (%)99.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4580301521
Minimum1000102
Maximum9900000190
Zeros0
Zeros (%)0.0%
Memory size169.0 KiB
Mini histogram

Quantile statistics

Minimum1000102
5-th percentile512480335
Q12123049194
median3904930410
Q37308900445
95-th percentile9297300429
Maximum9900000190
Range9899000088
Interquartile range (IQR)5185851251

Descriptive statistics

Standard deviation2876565571
Coefficient of variation (CV)0.6280297396
Kurtosis-1.260541871
Mean4580301521
Median Absolute Deviation (MAD)2543592458
Skewness0.2433285476
Sum9.899405677e+13
Variance8.274629486e+18
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[1.00010200e+06 7.60006100e+06 7.60013050e+06 1.15005650e+07 1.15205050e+07 ... 9.83930020e+09 9.83930111e+09 9.84230007e+09 9.84230051e+09 9.90000019e+09], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
795000620 3 < 0.1%
 
2206700215 2 < 0.1%
 
643300040 2 < 0.1%
 
3333002450 2 < 0.1%
 
1995200200 2 < 0.1%
 
1781500435 2 < 0.1%
 
3904100089 2 < 0.1%
 
3323059027 2 < 0.1%
 
6300000226 2 < 0.1%
 
9809000020 2 < 0.1%
 
Other values (21426) 21592 99.9%
 
ValueCountFrequency (%) 
1000102 2 < 0.1%
 
1200019 1 < 0.1%
 
1200021 1 < 0.1%
 
2800031 1 < 0.1%
 
3600057 1 < 0.1%
 
ValueCountFrequency (%) 
9900000190 1 < 0.1%
 
9895000040 1 < 0.1%
 
9842300540 1 < 0.1%
 
9842300485 1 < 0.1%
 
9842300095 1 < 0.1%
 

lat
Real number (ℝ≥0)

Distinct count5034
Unique (%)23.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47.56005252
Minimum47.1559
Maximum47.7776
Zeros0
Zeros (%)0.0%
Memory size169.0 KiB
Mini histogram

Quantile statistics

Minimum47.1559
5-th percentile47.3103
Q147.471
median47.5718
Q347.678
95-th percentile47.74964
Maximum47.7776
Range0.6217
Interquartile range (IQR)0.207

Descriptive statistics

Standard deviation0.1385637102
Coefficient of variation (CV)0.002913447377
Kurtosis-0.6763130016
Mean47.56005252
Median Absolute Deviation (MAD)0.1148297137
Skewness-0.4852704765
Sum1027915.415
Variance0.0191999018
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[47.1559 47.18955 47.19365 47.19585 47.2141 ... 47.70015 47.73735 47.74675 47.75945 47.7776 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
47.5491 17 0.1%
 
47.6846 17 0.1%
 
47.6624 17 0.1%
 
47.5322 17 0.1%
 
47.6711 16 0.1%
 
47.6886 16 0.1%
 
47.6955 16 0.1%
 
47.686 15 0.1%
 
47.6647 15 0.1%
 
47.6904 15 0.1%
 
Other values (5024) 21452 99.3%
 
ValueCountFrequency (%) 
47.1559 1 < 0.1%
 
47.1593 1 < 0.1%
 
47.1622 1 < 0.1%
 
47.1647 1 < 0.1%
 
47.1764 1 < 0.1%
 
ValueCountFrequency (%) 
47.7776 3 < 0.1%
 
47.7775 3 < 0.1%
 
47.7774 1 < 0.1%
 
47.7772 3 < 0.1%
 
47.7771 2 < 0.1%
 

long
Real number (ℝ)

Distinct count752
Unique (%)3.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-122.2138964
Minimum-122.519
Maximum-121.315
Zeros0
Zeros (%)0.0%
Memory size169.0 KiB
Mini histogram

Quantile statistics

Minimum-122.519
5-th percentile-122.387
Q1-122.328
median-122.23
Q3-122.125
95-th percentile-121.979
Maximum-121.315
Range1.204
Interquartile range (IQR)0.203

Descriptive statistics

Standard deviation0.1408283424
Coefficient of variation (CV)-0.001152310388
Kurtosis1.049500887
Mean-122.2138964
Median Absolute Deviation (MAD)0.1151608925
Skewness0.8850529834
Sum-2641408.943
Variance0.01983262202
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[-122.519 -122.466 -122.442 -122.4155 -122.4125 ... -121.7685 -121.7435 -121.6945 -121.411 -121.315 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
-122.29 116 0.5%
 
-122.3 111 0.5%
 
-122.362 104 0.5%
 
-122.291 100 0.5%
 
-122.372 99 0.5%
 
-122.363 99 0.5%
 
-122.288 98 0.5%
 
-122.357 96 0.4%
 
-122.284 95 0.4%
 
-122.365 94 0.4%
 
Other values (742) 20601 95.3%
 
ValueCountFrequency (%) 
-122.519 1 < 0.1%
 
-122.515 1 < 0.1%
 
-122.514 1 < 0.1%
 
-122.512 1 < 0.1%
 
-122.511 2 < 0.1%
 
ValueCountFrequency (%) 
-121.315 2 < 0.1%
 
-121.316 1 < 0.1%
 
-121.319 1 < 0.1%
 
-121.321 1 < 0.1%
 
-121.325 1 < 0.1%
 

price
Real number (ℝ≥0)

Distinct count4028
Unique (%)18.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean540088.1418
Minimum75000
Maximum7700000
Zeros0
Zeros (%)0.0%
Memory size169.0 KiB
Mini histogram

Quantile statistics

Minimum75000
5-th percentile210000
Q1321950
median450000
Q3645000
95-th percentile1156480
Maximum7700000
Range7625000
Interquartile range (IQR)323050

Descriptive statistics

Standard deviation367127.1965
Coefficient of variation (CV)0.6797542255
Kurtosis34.58554043
Mean540088.1418
Median Absolute Deviation (MAD)233941.7243
Skewness4.024069145
Sum1.167292501e+10
Variance1.347823784e+11
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 75000. 109750. 149950. 150275. 159997.5 ... 2002500. 2587500. 3202000. 3825000. 7700000. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
450000 172 0.8%
 
350000 172 0.8%
 
550000 159 0.7%
 
500000 152 0.7%
 
425000 150 0.7%
 
325000 148 0.7%
 
400000 145 0.7%
 
375000 138 0.6%
 
300000 133 0.6%
 
525000 131 0.6%
 
Other values (4018) 20113 93.1%
 
ValueCountFrequency (%) 
75000 1 < 0.1%
 
78000 1 < 0.1%
 
80000 1 < 0.1%
 
81000 1 < 0.1%
 
82000 1 < 0.1%
 
ValueCountFrequency (%) 
7700000 1 < 0.1%
 
7062500 1 < 0.1%
 
6885000 1 < 0.1%
 
5570000 1 < 0.1%
 
5350000 1 < 0.1%
 

sqft_above
Real number (ℝ≥0)

Distinct count946
Unique (%)4.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1788.390691
Minimum290
Maximum9410
Zeros0
Zeros (%)0.0%
Memory size169.0 KiB
Mini histogram

Quantile statistics

Minimum290
5-th percentile850
Q11190
median1560
Q32210
95-th percentile3400
Maximum9410
Range9120
Interquartile range (IQR)1020

Descriptive statistics

Standard deviation828.0909777
Coefficient of variation (CV)0.4630369538
Kurtosis3.402303621
Mean1788.390691
Median Absolute Deviation (MAD)640.3860357
Skewness1.446664473
Sum38652488
Variance685734.6673
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 290. 465. 575. 665. 695. ... 4505. 4865. 5485. 6690. 9410.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1300 212 1.0%
 
1010 210 1.0%
 
1200 206 1.0%
 
1220 192 0.9%
 
1140 184 0.9%
 
1400 180 0.8%
 
1060 178 0.8%
 
1180 177 0.8%
 
1340 176 0.8%
 
1250 174 0.8%
 
Other values (936) 19724 91.3%
 
ValueCountFrequency (%) 
290 1 < 0.1%
 
370 1 < 0.1%
 
380 1 < 0.1%
 
384 1 < 0.1%
 
390 2 < 0.1%
 
ValueCountFrequency (%) 
9410 1 < 0.1%
 
8860 1 < 0.1%
 
8570 1 < 0.1%
 
8020 1 < 0.1%
 
7880 1 < 0.1%
 

sqft_basement
Real number (ℝ≥0)

ZEROS
Distinct count306
Unique (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean291.5090455
Minimum0
Maximum4820
Zeros13126
Zeros (%)60.7%
Memory size169.0 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q3560
95-th percentile1190
Maximum4820
Range4820
Interquartile range (IQR)560

Descriptive statistics

Standard deviation442.5750427
Coefficient of variation (CV)1.518220616
Kurtosis2.715574211
Mean291.5090455
Median Absolute Deviation (MAD)363.2358668
Skewness1.577965056
Sum6300385
Variance195872.6684
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 0. 5. 45. 95. 135. ... 1605. 1875. 2230. 2830. 4820.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 13126 60.7%
 
600 221 1.0%
 
700 218 1.0%
 
500 214 1.0%
 
800 206 1.0%
 
400 184 0.9%
 
1000 149 0.7%
 
900 144 0.7%
 
300 142 0.7%
 
200 108 0.5%
 
Other values (296) 6901 31.9%
 
ValueCountFrequency (%) 
0 13126 60.7%
 
10 2 < 0.1%
 
20 1 < 0.1%
 
40 4 < 0.1%
 
50 11 0.1%
 
ValueCountFrequency (%) 
4820 1 < 0.1%
 
4130 1 < 0.1%
 
3500 1 < 0.1%
 
3480 1 < 0.1%
 
3260 1 < 0.1%
 

sqft_living
Real number (ℝ≥0)

Distinct count1038
Unique (%)4.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2079.899736
Minimum290
Maximum13540
Zeros0
Zeros (%)0.0%
Memory size169.0 KiB
Mini histogram

Quantile statistics

Minimum290
5-th percentile940
Q11427
median1910
Q32550
95-th percentile3760
Maximum13540
Range13250
Interquartile range (IQR)1123

Descriptive statistics

Standard deviation918.440897
Coefficient of variation (CV)0.4415794093
Kurtosis5.24309299
Mean2079.899736
Median Absolute Deviation (MAD)698.3239196
Skewness1.471555427
Sum44952873
Variance843533.6814
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 290. 510. 665. 695. 804.5 ... 4755. 5560. 6077.5 8015. 13540. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1300 138 0.6%
 
1400 135 0.6%
 
1440 133 0.6%
 
1010 129 0.6%
 
1660 129 0.6%
 
1800 129 0.6%
 
1820 128 0.6%
 
1480 125 0.6%
 
1720 125 0.6%
 
1540 124 0.6%
 
Other values (1028) 20318 94.0%
 
ValueCountFrequency (%) 
290 1 < 0.1%
 
370 1 < 0.1%
 
380 1 < 0.1%
 
384 1 < 0.1%
 
390 2 < 0.1%
 
ValueCountFrequency (%) 
13540 1 < 0.1%
 
12050 1 < 0.1%
 
10040 1 < 0.1%
 
9890 1 < 0.1%
 
9640 1 < 0.1%
 

sqft_living15
Real number (ℝ≥0)

Distinct count777
Unique (%)3.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1986.552492
Minimum399
Maximum6210
Zeros0
Zeros (%)0.0%
Memory size169.0 KiB
Mini histogram

Quantile statistics

Minimum399
5-th percentile1140
Q11490
median1840
Q32360
95-th percentile3300
Maximum6210
Range5811
Interquartile range (IQR)870

Descriptive statistics

Standard deviation685.3913043
Coefficient of variation (CV)0.3450154512
Kurtosis1.59709581
Mean1986.552492
Median Absolute Deviation (MAD)536.2192073
Skewness1.108181276
Sum42935359
Variance469761.2399
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 399. 680. 829. 975. 994. ... 3755. 3995. 4325. 4945. 6210.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1540 197 0.9%
 
1440 195 0.9%
 
1560 192 0.9%
 
1500 181 0.8%
 
1460 169 0.8%
 
1580 167 0.8%
 
1610 166 0.8%
 
1800 166 0.8%
 
1720 166 0.8%
 
1620 165 0.8%
 
Other values (767) 19849 91.8%
 
ValueCountFrequency (%) 
399 1 < 0.1%
 
460 2 < 0.1%
 
620 2 < 0.1%
 
670 1 < 0.1%
 
690 2 < 0.1%
 
ValueCountFrequency (%) 
6210 1 < 0.1%
 
6110 1 < 0.1%
 
5790 6 < 0.1%
 
5610 1 < 0.1%
 
5600 1 < 0.1%
 

sqft_lot
Real number (ℝ≥0)

Distinct count9782
Unique (%)45.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15106.96757
Minimum520
Maximum1651359
Zeros0
Zeros (%)0.0%
Memory size169.0 KiB
Mini histogram

Quantile statistics

Minimum520
5-th percentile1800
Q15040
median7618
Q310688
95-th percentile43339.2
Maximum1651359
Range1650839
Interquartile range (IQR)5648

Descriptive statistics

Standard deviation41420.51152
Coefficient of variation (CV)2.741815082
Kurtosis285.0778197
Mean15106.96757
Median Absolute Deviation (MAD)13837.26422
Skewness13.06001896
Sum326506890
Variance1715658774
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[5.200000e+02 6.755000e+02 8.635000e+02 1.154500e+03 1.351500e+03 ... 2.178025e+05 2.246055e+05 2.942475e+05 5.061020e+05 1.651359e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5000 358 1.7%
 
6000 290 1.3%
 
4000 251 1.2%
 
7200 220 1.0%
 
4800 120 0.6%
 
7500 119 0.6%
 
4500 114 0.5%
 
8400 111 0.5%
 
9600 109 0.5%
 
3600 103 0.5%
 
Other values (9772) 19818 91.7%
 
ValueCountFrequency (%) 
520 1 < 0.1%
 
572 1 < 0.1%
 
600 1 < 0.1%
 
609 1 < 0.1%
 
635 1 < 0.1%
 
ValueCountFrequency (%) 
1651359 1 < 0.1%
 
1164794 1 < 0.1%
 
1074218 1 < 0.1%
 
1024068 1 < 0.1%
 
982998 1 < 0.1%
 

sqft_lot15
Real number (ℝ≥0)

Distinct count8689
Unique (%)40.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12768.45565
Minimum651
Maximum871200
Zeros0
Zeros (%)0.0%
Memory size169.0 KiB
Mini histogram

Quantile statistics

Minimum651
5-th percentile1999.2
Q15100
median7620
Q310083
95-th percentile37062.8
Maximum871200
Range870549
Interquartile range (IQR)4983

Descriptive statistics

Standard deviation27304.17963
Coefficient of variation (CV)2.138408933
Kurtosis150.76311
Mean12768.45565
Median Absolute Deviation (MAD)10118.66071
Skewness9.506743247
Sum275964632
Variance745518225.3
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[6.510000e+02 9.145000e+02 1.056500e+03 1.168000e+03 1.279500e+03 ... 2.177945e+05 2.180110e+05 2.245555e+05 4.364705e+05 8.712000e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5000 427 2.0%
 
4000 357 1.7%
 
6000 289 1.3%
 
7200 211 1.0%
 
4800 145 0.7%
 
7500 142 0.7%
 
8400 116 0.5%
 
3600 111 0.5%
 
4500 111 0.5%
 
5100 109 0.5%
 
Other values (8679) 19595 90.7%
 
ValueCountFrequency (%) 
651 1 < 0.1%
 
659 1 < 0.1%
 
660 1 < 0.1%
 
748 2 < 0.1%
 
750 4 < 0.1%
 
ValueCountFrequency (%) 
871200 1 < 0.1%
 
858132 1 < 0.1%
 
560617 1 < 0.1%
 
438213 1 < 0.1%
 
434728 1 < 0.1%
 

view
Real number (ℝ≥0)

ZEROS
Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2343034285
Minimum0
Maximum4
Zeros19489
Zeros (%)90.2%
Memory size169.0 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2
Maximum4
Range4
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.7663175693
Coefficient of variation (CV)3.270620384
Kurtosis10.89302168
Mean0.2343034285
Median Absolute Deviation (MAD)0.4225548992
Skewness3.395749593
Sum5064
Variance0.587242617
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 19489 90.2%
 
2 963 4.5%
 
3 510 2.4%
 
1 332 1.5%
 
4 319 1.5%
 
ValueCountFrequency (%) 
0 19489 90.2%
 
1 332 1.5%
 
2 963 4.5%
 
3 510 2.4%
 
4 319 1.5%
 
ValueCountFrequency (%) 
4 319 1.5%
 
3 510 2.4%
 
2 963 4.5%
 
1 332 1.5%
 
0 19489 90.2%
 

waterfront
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size169.0 KiB
0
21450
1
 
163
ValueCountFrequency (%) 
0 21450 99.2%
 
1 163 0.8%
 

yr_built
Real number (ℝ≥0)

Distinct count116
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1971.005136
Minimum1900
Maximum2015
Zeros0
Zeros (%)0.0%
Memory size169.0 KiB
Mini histogram

Quantile statistics

Minimum1900
5-th percentile1915
Q11951
median1975
Q31997
95-th percentile2011
Maximum2015
Range115
Interquartile range (IQR)46

Descriptive statistics

Standard deviation29.3734108
Coefficient of variation (CV)0.01490275711
Kurtosis-0.6574075047
Mean1971.005136
Median Absolute Deviation (MAD)24.56566156
Skewness-0.4698053988
Sum42599334
Variance862.7972622
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[1900. 1900.5 1904.5 1909.5 1910.5 ... 2009.5 2011.5 2013.5 2014.5 2015. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2014 559 2.6%
 
2006 454 2.1%
 
2005 450 2.1%
 
2004 433 2.0%
 
2003 422 2.0%
 
2007 417 1.9%
 
1977 417 1.9%
 
1978 387 1.8%
 
1968 381 1.8%
 
2008 367 1.7%
 
Other values (106) 17326 80.2%
 
ValueCountFrequency (%) 
1900 87 0.4%
 
1901 29 0.1%
 
1902 27 0.1%
 
1903 46 0.2%
 
1904 45 0.2%
 
ValueCountFrequency (%) 
2015 38 0.2%
 
2014 559 2.6%
 
2013 201 0.9%
 
2012 170 0.8%
 
2011 130 0.6%
 

yr_renovated
Real number (ℝ≥0)

ZEROS
Distinct count70
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean84.4022579
Minimum0
Maximum2015
Zeros20699
Zeros (%)95.8%
Memory size169.0 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum2015
Range2015
Interquartile range (IQR)0

Descriptive statistics

Standard deviation401.67924
Coefficient of variation (CV)4.759105384
Kurtosis18.70115212
Mean84.4022579
Median Absolute Deviation (MAD)161.6658804
Skewness4.549493367
Sum1824186
Variance161346.2119
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 0. 967. 1937. 1954.5 1976.5 ... 2007.5 2012.5 2013.5 2014.5 2015. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 20699 95.8%
 
2014 91 0.4%
 
2013 37 0.2%
 
2003 36 0.2%
 
2000 35 0.2%
 
2007 35 0.2%
 
2005 35 0.2%
 
2004 26 0.1%
 
1990 25 0.1%
 
2006 24 0.1%
 
Other values (60) 570 2.6%
 
ValueCountFrequency (%) 
0 20699 95.8%
 
1934 1 < 0.1%
 
1940 2 < 0.1%
 
1944 1 < 0.1%
 
1945 3 < 0.1%
 
ValueCountFrequency (%) 
2015 16 0.1%
 
2014 91 0.4%
 
2013 37 0.2%
 
2012 11 0.1%
 
2011 13 0.1%
 

zipcode
Real number (ℝ≥0)

Distinct count70
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean98077.9398
Minimum98001
Maximum98199
Zeros0
Zeros (%)0.0%
Memory size169.0 KiB
Mini histogram

Quantile statistics

Minimum98001
5-th percentile98004
Q198033
median98065
Q398118
95-th percentile98177
Maximum98199
Range198
Interquartile range (IQR)85

Descriptive statistics

Standard deviation53.50502626
Coefficient of variation (CV)0.0005455357888
Kurtosis-0.8534788732
Mean98077.9398
Median Absolute Deviation (MAD)46.72127898
Skewness0.4056612082
Sum2119758513
Variance2862.787835
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[98001. 98001.5 98002.5 98004.5 98005.5 ... 98151.5 98183. 98193. 98198.5 98199. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
98103 602 2.8%
 
98038 590 2.7%
 
98115 583 2.7%
 
98052 574 2.7%
 
98117 553 2.6%
 
98042 548 2.5%
 
98034 545 2.5%
 
98118 508 2.4%
 
98023 499 2.3%
 
98006 498 2.3%
 
Other values (60) 16113 74.6%
 
ValueCountFrequency (%) 
98001 362 1.7%
 
98002 199 0.9%
 
98003 280 1.3%
 
98004 317 1.5%
 
98005 168 0.8%
 
ValueCountFrequency (%) 
98199 317 1.5%
 
98198 280 1.3%
 
98188 136 0.6%
 
98178 262 1.2%
 
98177 255 1.2%
 

Correlations

Missing values

Sample

First rows

bathroomsbedroomsconditiondatefloorsgradeidlatlongpricesqft_abovesqft_basementsqft_livingsqft_living15sqft_lotsqft_lot15viewwaterfrontyr_builtyr_renovatedzipcode
01.003.0320141013T0000001.07712930052047.5112-122.257221900.0118001180134056505650001955098178
12.253.0320141209T0000002.07641410019247.7210-122.319538000.021704002570169072427639001951199198125
21.002.0320150225T0000001.06563150040047.7379-122.233180000.077007702720100008062001933098028
33.004.0520141209T0000001.07248720087547.5208-122.393604000.010509101960136050005000001965098136
42.003.0320150218T0000001.08195440051047.6168-122.045510000.0168001680180080807503001987098074
54.504.0320140512T0000001.011723755031047.6561-122.0051225000.03890153054204760101930101930002001098053
62.253.0320140627T0000002.07132140006047.3097-122.327257500.0171501715223868196819001995098003
71.503.0320150115T0000001.07200800027047.4095-122.315291850.0106001060165097119711001963098198
81.003.0320150415T0000001.07241460012647.5123-122.337229500.010507301780178074708113001960098146
92.503.0320150312T0000002.07379350016047.3684-122.031323000.0189001890239065607570002003098038

Last rows

bathroomsbedroomsconditiondatefloorsgradeidlatlongpricesqft_abovesqft_basementsqft_livingsqft_living15sqft_lotsqft_lot15viewwaterfrontyr_builtyr_renovatedzipcode
216032.503.0320140825T0000002.08785214004047.5389-121.881507250.0227002270227055365731002003098065
216042.003.0320150126T0000003.08983420136747.5699-122.288429000.0149001490140011261230002014098144
216052.504.0320141014T0000002.09344890021047.5137-122.167610685.0252002520252060236023002014098056
216063.504.0320150326T0000002.09793600042947.5537-122.3981007500.026009103510205072006200002009098136
216072.503.0320150219T0000002.08299780002147.5773-122.409475000.011801301310133012941265002008098116
216082.503.0320140521T0000003.0826300001847.6993-122.346360000.0153001530153011311509002009098103
216092.504.0320150223T0000002.08660006012047.5107-122.362400000.0231002310183058137200002014098146
216100.752.0320140623T0000002.07152330014147.5944-122.299402101.0102001020102013502007002009098144
216112.503.0320150116T0000002.0829131010047.5345-122.069400000.0160001600141023881287002004098027
216120.752.0320141015T0000002.07152330015747.5941-122.299325000.0102001020102010761357002008098144